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(54) Method and apparatus for encoding video shape and texture information 



(57) An encoding apparatus and method adjust tex- 
ture information for a video object based on original 
shape information for the video object. The original 
shape information is not encoded shape information or 
information that has been encoded and then decoded. 
In a preferred embodiment, the texture information is 
adjusted by 0 padding pels near the boundary of the 
video object in an inter mode, or by lowpass extrapola- 
tion filtering pels near the boundary of the video object 
in an intra mode where the boundary of the video object 
is defined by the original shape information. Lossy 
shape encoding can therefore be performed while 
decreasing the overall encoded bit rate, improving the 
subjective appearance of the decoded video object and 
decreasing computational complexity of the encoder 
and/or encoding method. Adaptive scanning and sub- 
sampling of macroblocks can be avoided while having 
little effect on overall bitrate and significantly decreasing 
computational complexity of the encoding process. 



FIG. 10 



IDENTIFY VIDEO OBJECT IN 
A CURRENT VIDEO FRAME 



GENERATE SHAPE INFORUATION 
THAT DEFINES AN ORIGINAL SHAPE 
FOR THE VIDEO OBJEa 



USE THE SHAPE INFORMATION 
TO ADJUST TEXTURE INFOIWATION 
FOR THE VB>EO OBJECT 



100 



•200 



■300 



ENCODE THE ADJUSTED TEXTURE 
INFORMATION 



-400 



CM 
< 
CO 

o 

CO 

CM 
CO 



LU 



Printed by Xerox (UK) Business Services 
2.16.7/3.6 



1 



EP0 932 306 A2 



2 



Description 

BACKGROUND OF THE INVENTION 

1. Field of Invention s 

[0001 ] This invention relates to encoding video texture 
information, especially encoding information consistent 
with the MPEG-4 standard. 

10 

2. Description of Related Art 

[0002] MPEG-4 Visual, that part of the upcoming 
MPEG-4 standard describing the coding of natural and 
synthetic video signals, allows the encoding of video 15 
objects using motion, texture and shape information. 
Video objects, e.g., distinct bodies or portions in an 
image, are identified and the outline, i.e., shape, of the 
video object is identified. In one aspect of MPEG-4 Vis- 
ual, shape information for a video object in the form of a 20 
bitmap is coded, and then the coded shape information 
is used to pad texture information and to code the tex- 
ture information. 

SUMMARY OF THE INVENTION 25 

[0003] Using coded shape information to pad and 
code texture information can cause problems, such as 
increasing the overall bit rate when lossy coded shape 
information is transmitted. The inventor has discovered 30 
that using original shape information, i.e., uncoded 
shape information, to pad texture information before 
coding makes shape distortion less visible and 
decreases the overall data rate, especially when lossy 
coding shape information. 35 
[0004] The invention provides a method and appara- 
tus for coding video information wherein original shape 
information for a video object is used to adjust texture 
information before the texture information is coded. In a 
preferred embodiment consistent with the MPEG-4 40 
standard, shape information is generated by first identi- 
fying a video object in a video frame, or portion of a 
frame, and generating a bitmap defining an outline of 
the video object in a current time instant, i.e., a current 
video object plane (VOP). This unencoded shape infer- 45 
mation is then used to determine which area(s) of the 
texture prediction error or other texture parameters, 
e.g., texture information for the current VOP before the 
texture prediction error is determined, for the current 
VOP should be padded. The unpadded texture predic- so 
tion error is determined based on padded reference 
VOP texture information, predicted texture motion infor- 
mation for the current VOP and unpadded texture infor- 
mation for the current VOP. The padded reference VOP 
texture information and the predicted texture motion 55 
information for the current VOP are used to generate 
predicted texture information for the reference VOP. The 
difference between the predicted texture information for 



the reference VOP and the texture information for the 
current VOP is the texture prediction error. 
[0005] The invention also provides techniques for 
smoothing object boundaries to provide a more predict- 
able quality of lossily encoded shapes. According to one 
aspect of the invention, filtering, such as morphological 
filtering, is used to smooth object boundaries of a video 
object before encoding in order to decrease or maintain 
an overall bitrate for the transmitted data. The filtering 
can be performed on motion-compensated shape pre- 
diction error using an unencoded, or original, shape as 
a reference. Thus, a change in an object's topology can 
be avoided while maintaining a subjective appearance 
of the decoded video object and minimizing the overall 
bitrate of encoded image data. Morphological filtering 
can be performed on an entire VOP or on each bound- 
ary block of an object. 

[0006] The invention also provides techniques for sig- 
nificantly decreasing the computational complexity of 
encoding video information while having little affect on 
the overall bit rate and/or subjective image quality. 
Lossy shape coding can be done without subsampling 
with little affect on subjective image quality or overall bit 
rate. Similarly, the encoder can operate so that only hor- 
izontal scanning of macroblocks is enabled during 
shape coding, rather than allowing adaptive scanning, 
i.e., adaptively switching between horizontal and verti- 
cal scanning. Using only horizontal scanning usually 
increases the shape bitrate by approximately 1 -8%, but 
significantly decreases the computational complexity of 
the encoder and has little influence on the overall 
bitrate. 

[0007] These and other aspects of the invention will 
be apparent or obvious from the following description. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0008] The invention is described in detail with regard 
to the following figures wherein like numerals reference 
like elements, and wherein: 

Fig. 1 is a schematic block diagram of an encoding 
apparatus; 

Fig. 2 shows an example video image; 

Fig. 3 shows a video object from the Fig. 3 image 

and corresponding bounding box; 

Fig. 4 is a schematic block diagram of a portion of 

the parameter coding module of the Fig. 1 encoding 

apparatus; 

Fig. 5 shows an example reference VOP and cur- 
rent VOP; 

Fig. 6 shows one boundary macroblock f rom Fig. 5; 
Figs. 7 and 8 show templates for determining a 
shape context for pels; 

Fig. 9 shows an exemplary video object and a cor- 
responding original shape, predicted shape and 
shape after coding/decoding; and 
Fig. 10 shows a flow chart of steps of a method for 
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coding texture information in accordance with the 
invention. 

DETAILED DESCRIPTION OF PREFERRED EIVIBQD- 
IMENTS 5 

[0009] The invention is described in connection with 
video data encoding in accordance with the MPEG-4 
standard. However, the invention can be used with other 
compression techniques. In addition, although a brief 
description of MPEG-4 encoding is provided below to 
give context to the invention, a detailed description of all 
of the aspects of MPEG-4 encoding is not provided 
since the details of standard MPEG-4 encoding are well 
known to those of skill in the art. 
[001 0] MPEG-4 Visual allows the transmission of arbi- 
trarily shaped video objects (VO) in a video sequence. 
As discussed above, a time instant of a VO is a video 
object plane (VOP), which can be a rectangular video 
frame or a part of a video frame. 
[0011] Fig. 1 is a schematic block diagram of an 
encoding apparatus 10 for encoding video data consist- 
ent with the MPEG-4 standard. To encode data for a VO, 
data representing a current VOP S for the VO is 
received and processed by an image analysis module 
1 . For example, data for the tree VO in the Image shown 
in Fig. 2 could be provided to the image analysis module 
1 . The image analysis module 1 creates a bounding box 
1 1 for the current VOP for the tree VO, as shown in Fig. 
3. The bounding box 11 is a rectangular box of minimum 
size that completely contains the VO. Each side of the 
bounding box 1 1 has a length equal to a multiple of 16 
pels (i.e., the macroblock size. The boxes shown in Fig. 
3 are macroblocks, not pels). Three different types of 
macroblocks inside the bounding box can be identified: 
transparent macroblocks which are positioned outside 
of a VO, opaque macroblocks which are positioned 
inside of a VO, and boundary macroblocks which are 
positioned at the boundary of the VO and include por- 
tions both inside and outside of the object. The image 
analysis module 1 transmits shape, texture, shape 
motion and texture motion parameters to a parameter 
coding module 2, which produces encoded texture, 
motion and shape information in a single bit stream for 
each VO. Shape motion and texture motion information 
is generated with respect to a prior reference VOP S' 
which has been previously processed, encoded by the 
parameter coding module 2, decoded by a decoding 
module 3, stored in a VOP memory 4, and fed back to 
the image analysis module 1 and the parameter coding 
module 2. 

[0012] Based on the shape, texture and motion infor- 
mation from the image analysis module 1 and the VOP 
memory 4, the parameter coding module 2 predictively 
codes the shape, texture, shape motion and texture 
motion parameters for the VO. As shown in Fig. 4, tex- 
ture motion and shape motion parameters are coded by 
coding modules 21 and 22, respectively The shape 



parameters are coded by a coding module 23 based on 
uncoded shape motion information and information 
regarding the reference VOP S'. Shape motion parame- 
ters are coded by the coding module 22 based on tex- 
ture motion information and coded shape information 
from the coding module 23. 

[0013] Coding of texture information in an inter mode 
is performed by first padding the reference VOP S' in a 
padding module 24. For motion compensated prediction 
of the texture of the current VOP S, the reference VOP 
S' is motion compensated using overlapped block 
motion compensation. Then, the boundary and trans- 
parent blocks of the reference VOP are padded. Bound- 
ary blocks are padded using repetitive padding where 
boundary pels are first replicated in the horizontal direc- 
tion, and then in the vertical direction. If a value can be 
assigned to a pel by both padding directions, an aver- 
age value is assigned to the pel. A simpler mean pad- 
ding is used in a second step where transparent 
macroblocks bordering boundary blocks are assigned 
an average value determined from the pels of its neigh- 
boring padded blocks. For example, Fig. 5 shows a ref- 
erence VOP S' that has been repetitively padded and 
padded using a constant value process. Blocks shown 
in Fig. 5 are macroblocks, not individual pels. The cur- 
rent VOP S related to the reference VOP S* is also 
shown in Fig. 5. 

[0014] When the reference VOP S' has been padded, 
texture information for the current VOP S is predicted in 
a prediction module 25 based on texture motion infor- 
mation for the current VOP S and the padded reference 
VOP S'. The difference between the texture information 
for the current VOP S and the predicted texture informa- 
tion is the texture prediction error provided to a second 
padding module 26. Based on the prediction error and 
original shape information for the current VOP S, the 
second padding module 26 pads the prediction error. As 
used herein, the term original shape information refers 
to unencoded shape information, and not shape infor- 
mation derived from encoded shape information. 
[001 5] The texture prediction error is padded using the 
original shape parameters so that in the inter mode, 
padding of the prediction error is preferably done using 
a "0 padding" technique. Since the prediction error in 
the inter mode has a mean of 0, 0 padding, i.e., padding 
pels outside of the object (as defined by the original 
shape information) with a 0 value, is very appropriate. In 
the intra mode, a lowpass extrapolation filter is prefera- 
bly used. For a preferred lowpass extrapolation filter, the 
average signal value for the VO in each boundary block 
is assigned to pels outside of the object as defined by 
the original shape information. Then, for each pel out- 
side of the object, the average of each pel's 4 neighbor- 
ing pels is assigned. This process is started in the top 
left corner of each macroblock and is repeated in scan 
order. The result is a smooth extrapolation of the texture 
signal at the object boundary As an example, Fig. 6 
shows a single boundary macroblock for a VO. Pels on 



15 



20 



25 



30 



35 



40 



45 



50 



3 



5 



EP0 932 306 A2 



6 



the outside of the object's boundary are 0 padded in the 
inter mode. In the intra mode, pels outside of the object 
are assigned an average signal value for pels in the 
boundary block Then, for pels outside of the object, the 
average of each pel's 4 neighboring pels is assigned. 5 
[0016] Once the prediction error or other texture infor- 
mation is padded, the prediction error or other texture 
information is coded by a coding module 27 based in 
part on coded shape information from the coding mod- 
ule 23. Coded parameters are output by the multiplexer 
28. 

[0017] Additional padding of the texture parameters 
for the current VOP can also be performed in an 
optional padding module 29 based on original shape 
information. This padding can be performed in addition 
to padding performed in the second padding module 26 
or in place of padding in the second padding module 26. 
[0018] Coding of the shape information is performed 
by a context-based arithmetic coder (not shown) in the 
coding module 23. As discussed above, three different 
types of macroblocks are identified: transparent mac- 
roblocks which are positioned outside of a VO, opaque 
macroblocks which are positioned inside of a VO, and 
boundary macroblocks which are positioned at the 
boundary of the VO and include portions both inside 
and outside of the object. Two types of VOs are possi- 
ble: opaque objects and transparent objects. For 
opaque objects, shape information in the boundary 
blocks is encoded as a bitmap. For transparent objects, 
binary shape information and a transparency value are 
preferably used. The transparency can also be 
described by a gray-scale alpha map defining the out- 
line as well as the varying transparency of an object. 
However, the outline of transparent objects in the 
boundary blocks is also encoded as a bitmap using the 
binary shape coder. 

[0019] For boundary macroblocks in the intra mode, a 
template of 10 pels shown in Fig. 7 is used to define the 
causal context for predicting the shape value of the cur- 
rent pel. The template extends up to 2 pels to the left, to 
the right and to the top of the pel to be coded. Thus, for 
encoding the pels in the left ends of the top two rows of 
a macroblock, parts of the template are defined by 
shape information of already transmitted macroblocks 
positioned above and to the left of the current macrob- 
lock. For the 2 right-most columns, undefined pels of the 
context are set to the value of its closest neighbor inside 
the current macroblock. 

[0020] To increase coding efficiency and allow lossy 
shape coding, a macroblock can be subsampled by a 
factor of 2 or 4, which results in a sub-block of 8x8 or 
4x4 pels, respectively The sub-block is encoded and 
the subsampling factor is transmitted to the decoder so 
that the decoder can decode the shape information and 
upsample the decoded sub-block to macroblock size. 
Depending on the upsampling filter, the decoded shape 
can appear somewhat blocky. A preferred upsampling 
filter in terms of subjective picture quality is an adaptive 



non-linear upsampling filter. However, subsampling is 
preferably not used by the encoder because subsam- 
pling has no predictable effect on the overall bit rate and 
significantly increases the computational complexity of 
the encoder and decoder. 

[0021] In addition, the encoder can transpose a mac- 
roblock prior to encoding, although this type of switching 
between horizontal and vertical scanning increases the 
computational complexity of the encoder while decreas- 
ing the bitrate, on average, by only 1%. Thus, the 
encoder preferably operates so that horizontal macrob- 
lock scanning is always performed, thus decreasing the 
computational complexity of the encoder and having lit- 
tle effect on, i.e., increase of, the overall bit rate. 
[0022] Fig. 8 shows a template used by the encoder in 
the inter mode to define the context for pels in boundary 
blocks. To exploit the temporal redundancy in the shape 
information, motion compensation is used so that a 2D 
integer pel motion vector is estimated using a full search 
for each macroblock to minimize the prediction error 
between the coded shape of the reference VOP S' and 
the shape of the current VOP S. The shape motion vec- 
tors are predictively encoded with respect to the shape 
motion vectors of neighboring macroblocks. If no shape 
motion vector is available, texture motion vectors are 
used as predictors. The shape motion vector of the cur- 
rent block is used to align the template shown in Fig. 8 
for coding the shape information. Four pels of the con- 
text are neighbors of the pel to be coded, and 5 pels of 
the context are located in the motion compensated posi- 
tion on the reference VOP S'. 

[0023] The subsampling and transposition options dis- 
cussed above are also available in the inter mode. How- 
ever, as discussed above, subsampling and 
transposition are preferably not used. For lossy shape 
coding, the encoder can also determine that the shape 
representation achieved by just carrying out motion 
compensation is sufficient, thus avoiding coding of the 
prediction error. 

[0024] MPEG-4 controls lossy shape coding by using 
an alpha threshold, which defines the maximum number 
of incorrectly coded pels in a macroblock. Thus, the 
topology of the shape can change, especially when iso- 
lated pels at the object boundary are coded as part of 
the object. Using morphological filters or other similar 
filtering to smooth object boundaries before shape cod- 
ing can provide a much more predictable quality of a 
lossily encoded shape. The filtering can be imple- 
mented by the image analysis module 1 in the coding 
module 23, or in another module before the shape infor- 
mation is encoded. When performed in the coding mod- 
ule 23, the filtering can be performed on the motion- 
compensated shape prediction error using the original 
shape of the object as a reference. Thus, a change in an 
object's topology can be avoided while maintaining a 
subjective appearance of the decoded video object and 
minimizing the overall bitrate of encoded image data. 
Morphological filtering can be performed on an entire 
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VOP or on each boundary block of an object. 
[0025] If morphological filtering is performed in the 
image analysis module 1, the object shape could be 
smoothed by applying an Erosion filter followed by a 
Dilation filter or a median filter, or a morphological 
Opening or Closing. (The morphological operations 
Erosion, Dilation, Opening and Closing are well known 
to those skilled in the art and are not described in detail 
here.) The result of this filtering is not the uncoded 
shape as it is defined in this invention, but just the input 
to the shape coder. 

[0026] If morphological filtering is performed in the 
shape coding module 23, filtering is done on a macrob- 
lock basis although neighboring blocks may be consid- 
ered. The purpose of the filtering is to enable lossy 
shape coding while maintaining high subjective quality 
In a preferred embodiment, the filter compares the orig- 
inal shape with the motion-compensated predicted 
shape. In a first step of the comparison, regions are 
marked that add holes or disconnected regions to the 
object, such as portions 901 and 902 shown in Fig. 9. 
Next, pels are marked that have a distance larger than a 
given number of pels from the original shape. As an 
example, for a threshold of 1 pel, the pels identified by 
the portion 903 are marked. When lossy shape coding, 
the coder 23 preferably only corrects the marked pels. 
In this example, the result is that after encoding and 
decoding, the object has a shape shown in Fig. 9. 
[0027] The encoding apparatus 10 is a general pur- 
pose data processing system, which can be a general 
purpose computer, or network of general purpose com- 
puters, and other associated devices, including commu- 
nications devices, modems, and/or other circuitry or 
components necessary to perform the desired 
input/output or other functions. The encoding apparatus 
10 can also be implemented, at least in part, as a single 
special purpose integrated circuit (e.g., ASIC) or an 
array of ASICs, each having a main or central processor 
section for overall, system-level control, and separate 
sections dedicated to performing various different spe- 
cific computations, functions and other processes under 
the control of the central processor section. The encod- 
ing apparatus 10 can also be implemented using a plu- 
rality of separate dedicated programmable integrated or 
other electronic circuits or devices, e.g., hardwired elec- 
tronic or logic circuits such as discrete element circurts 
or programmable logic devices. 
[0028] The VOP memory 4 can be one or more vola- 
tile and/or non-volatile memory devices, such as optical 
disk, magnetic media, semiconductor or other memory 
devices. 

[0029] The image analysis module 1 , parameter cod- 
ing module 2, decoding module 3. the other coding 
modules 21, 22, 23 and 27, the padding modules 24 
and 26 and/or the prediction module 25 can be imple- 
mented as software modules that are executed by the 
encoding apparatus 10 or any other suitable data 
processing apparatus. Alternately these modules can 



be implemented as hard-wired electronic circuits or 
other programmed integrated or other electronic circuits 
or devices, e.g., hardwired electronic or logic circuits 
such as discrete element circuits or programmable logic 
5 devices. 

[0030] Fig. 10 is a flow chart of steps of a method for 
encoding texture information for a video object in a 
video image. In step 100, a video object in a video frame 
or other image data is identified. In step 200, shape 

10 information that defines the original shape of the video 
object is generated. That is, the shape information 
defines the shape or outline of the video object as rep- 
resented in the raw incoming data. The original shape 
information is not shape information derived by encod- 

15 ing and then decoding the video object's shape or out- 
line. For example, a bitmap or other data defining the 
outline of the video object can be generated. In step 
300, the shape information is used to adjust texture 
information for the video object. For example, the shape 

20 information can be used to identify and pad texture val- 
ues for pels outside of the video object. Such padding 
can include "0 padding" described above, or other tech- 
niques, such as averaging pel values for portions of the 
image near the video object or in other portions of the 

25 image. In step 400, the adjusted texture information is 
encoded, for example, using a DCT or any other trans- 
form or encoding technique. 

[0031] The above method can also include, e.g., 
between steps 200 and 300, the optional step of gener- 
al ating shape information that approximates the original 
shape of the video object. For example, this step could 
include generating lossy encoded shape information. 
[0032] While this invention has been described in con- 
junction with specific embodiments thereof, it is evident 
35 that many alternatives, modifications, and variations will 
be apparent to those skilled in the art. Accordingly, pre- 
ferred embodiments of the invention as set forth herein 
are intended to be illustrative, not limiting. Various 
changes may be made without departing from the spirit 
40 and scope of the invention. 

[0033] Where technical features mentioned in any 
claim are followed by reference signs, those reference 
signs have been included for the sole purpose of 
increasing the intelligibility of the claims and accord- 
45 ingly, such reference signs do not have any limiting 
effect on the scope of each element identified by way of 
example by such reference signs. 

Claims 

50 

1. A method for encoding texture information for a 
video object, comprising: 

identifying a video object in image data repre- 
55 senting at least an original shape and texture 

information for the video object; 
generating original shape information that rep- 
resents an original shape of the video object; 
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adjusting texture information in tlie image data 
using the original shape information; and 
encoding the adjusted texture information. 

2. The method of claim 1 , wherein the step of generat- 
ing original shape information comprises generat- 
ing a bitmap defining the shape of the video object. 

3. The method of claim 1 , wherein the step of generat- 
ing original shape information comprises identifying 
transparent, boundary and opaque macroblocks of 
pels of the video object. 

4. The method of claim 1 , wherein the step of adjust- 
ing texture information comprises padding a texture 
prediction error for pels using a 0 padding tech- 
nique. 

5. The method of claim 1 , wherein the step of adjust- 
ing texture information comprises padding texture 
parameters for pels using a lowpass extrapolation 
filter. 

6. The method of claim 1 , wherein the step of adjust- 
ing texture information comprises replicating 
boundary pels in a horizontal direction, replicating 
boundary pels in a vertical direction, and assigning 
an average value of neighboring pels to transparent 
macroblocl^ bordering boundary blocks. 

7. The method of claim 1, further comprising lossy 
encoding shape information for the video object. 

8. The method of claim 1, further comprising lossy 
encoding shape information without subsampling. 

9. The method of claim 1 , further comprising encoding 
shape information without vertical scanning of mac- 
roblocl^. 

10. The method of claim 1, further comprising smooth- 
ing a boundary of the video object using filtering. 

11. The method of claim 10, wherein the step of 
smoothing a boundary of the video object using fil- 
tering comprises one of using morphological filter- 
ing on an entire VOP and using morphological 
filtering on each boundary block of a VOP. 

12. The method of claim 10, wherein the step of 
smoothing a boundary of the video object using fil- 
tering comprises using morphological filtering on a 
motion-compensated shape prediction error using 
the original shape information as a reference to 
avoid change of topology of the video object, mini- 
mize bitrate and maintain subjective image quality. 

13. A computer-readable recording medium containing 



image data that has been encoded according to the 
method of claim 1 . 

14. An encoding apparatus comprising: 

5 

an image analysis module that generates 
shape, texture and motion information from 
image data representing at least a video object; 
a parameter coding module that adjusts texture 

10 information generated by the image analysis 

module based on original shape information 
from the image analysis module and encodes 
the adjusted texture information and shape and 
motion information; 

15 a decoding module that decodes encoded tex- 

ture, shape and motion information from the 
parameter coding module; and 
a memory that stores decoded data from the 
decoding module. 

20 

15. The encoding apparatus of claim 14, wherein the 
image analysis module generates original shape 
information comprising a bitmap defining a shape 
of the video object. 

25 

16. The encoding apparatus of claim 14, wherein the 
parameter coding module adjusts texture informa- 
tion by one of padding a texture prediction error for 
pels using a 0 padding technique and using a low- 
so pass extrapolation filter. 

17. The encoding apparatus of claim 14, wherein the 
parameter coding module adjusts texture informa- 
tion by replicating boundary pels in a horizontal 

35 direction, replicating boundary pels in a vertical 
direction, and assigning an average value of neigh- 
boring pels to transparent macroblocks bordering 
boundary blocks. 

40 18. The encoding apparatus of claim 14, wherein the 
parameter coding module performs lossy shape 
coding. 

19. The encoding apparatus of claim 14, wherein the 
45 parameter coding module performs lossy shape 

coding without subsampling. 

20. The encoding apparatus of claim 14, wherein the 
parameter coding module performs shape coding 

50 without vertical scanning of macroblocks. 

21. The encoding apparatus of claim 14, further com- 
prising a smoothing module that smoothes a 
boundary of the video object using filtering. 

55 

22. The encoding apparatus of claim 21, wherein the 
smoothing module applies one of morphological fil- 
tering on an entire VOP and morphological filtering 
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on each boundary block of a VOR 

23. The encoding apparatus of claim 21, wherein the 
smoothing module applies morphological filtering 
on a motion-compensated shape prediction error 5 
using the original shape information as a reference 
to avoid change of topology of the video object, 
minimize bitrate, and maintain subjective image 
quality. 
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24. An encoding apparatus, comprising: 



identifying means for identifying a video object 
in image data representing at least an original 
shape and texture information for the video is 
object; 

generating means for generating original 
shape information that represents an original 
shape of the video object; 

adjusting means for adjusting texture informa- 20 
tion in the image data using the original shape 
information; and 

encoding means for encoding the adjusted tex- 
ture information. 
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